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CONJUGATES THAT CONTAIN THE HOMEODOMAIN OF ANTENNAPEDIA 



5 The present invention relates to a novel conjugate, particularly, but not exclusively, 
in the form of a fusion protein or protein/nucleic acid complex. 

Gene therapy provides the potential to permanently cure selected genetic diseases. 
However, a major obstacle is the effective delivery of the gene or protein of interest 

10 to the target site. A variety of viral and non-viral vectors have been developed to 
deliver genes or gene products to various cells, tissues and organs by ex vivo or in 
vivo strategies. Among viral-based vectors, retroviruses, adenoviruses, adeno- 
associated viruses and herpes viruses have been most extensively studied. Among 
non-viral-based vectors, liposomes have been used to introduce plasmid DNA 

15 directly into animals. However, one of the main challenges of gene therapy remains 
the design of -effective delivery systems. 

The gene antennapedia (Antp) encodes a transcriptional factor that has been shown to 
control antero-posterior morphogenesis in Drosophila embryo. The protein sequence 

20 of antennapedia is characterised by the presence of a 60 amino acids motif 
(homeodomain) that binds to specific DNA target elements. Antennapedia 
homologues have been found in nearly all multicellular organisms and show a very 
high degree of amino acid sequence identity. The human and drosophila 
antennapedia proteins differ in the sequence of the homeodomain only for one 

25 conservative amino acid substitution. 

It has been observed that antennapedia and its homeodomain are able to translocate 
across the cytoplasmic membrane of mammalian cells. The translocation does not 
depend on cell endocytosis and it has been reported that translocation occurs at both 
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4°C and 37° C. Homeodomain synthetic peptides made of D amino acids are also able 
to cross the cytoplasmic membrane. This finding would rule out the possibility that 
Antp is translocated through a receptor mediated mechanism. This property has been 
exploited to vehiculate small viral sequences into the cytoplasm of cultured cells as 
well as to elicit an MHC class I restricted cytotoxic immune response against the 
nucleoprotein of the influenza virus. However, to date the homeodomain of Antp has 
only been used to transport small synthetic peptides. Schutze-Redehneier M-P et al 
(1 996) Introduction of Exogenous Antigens into the MHC Class I Processing and 
Presentation Pathway by Drosophila Antennapedia Homeodomain Primes Cytotoxic 
T Cells In Vivo. The Journal of Immunology 650-655 mentions that the 
homeodomain of the antennapedia molecule can be used to deliever up to 50 
additional amino acids to the cytoplasm. Using 16 amino acids of the third helix they 
mention that fusion peptides containing up to 94 amino acids could be prepared, but 
this is not demonstrated. Further the thrust of the disclosure is only towards delivery 
of synthetic peptides. 

We have now found that the homeodomain of Antp can be used to translocate 
proteins, including non-synthetic proteins. One of the key advantages of the present 
invention is that is can be used to translocate functional and regulatory proteins. This 
is surprising and very important, particularly for medical applications. 

The ability to deliver the expression product of a gene of interest directly into a cell 
has wide applicability, particularly in the medical field. We have now also 
surprisingly found that the homeodomain is able to translocate nucleic acid. This is 
especially advantageous for gene therapy applications. 



Thus according to one aspect of the present invention there is provided a 
comprising: 
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(a) a first region comprising the homeodomain of antennapedia or a variant 
thereof; and 

(b) a second region not naturally associated with the first region; 
and wherein at least the first region is non-denatured. 

According to one embodiment the first and second regions are associated via for 
example a disulfide bond, or the second region may comprise a nucleic acid binding 
domain, preferably further comprising nucleic acid. This embodiment may be seen as 
a protein/nucleic acid complex. 

According to a further embodiment the conjugate is in the form of a fusion protein. 
In this embodiment, preferably the second region is a functional or regulatory 
protein, or an antigen. 



15 According to a further aspect of the present invention there is provided a conjugate 
comprising: 

(a) a first region comprising the homeodomain of antennapedia or a variant 
thereof operably linked to 

(b) a second region not naturally associated with the first region comprising a 
20 protein of at least 100 amino acids. 

By "operably linked" we include that the first and second region are linked such that 
the second region is able to translocate a cell membrane. 

25 Preferably the whole conjugate is non-denatured. 

By non-denaturing we do not necessarily imply a specific non-denaturing step; 
although this may be the case. 
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Denaturation is the process by which the three-dimensional shape of the protein 
molecule is changed from its native state without rupture of peptide bonds. It can 
also include disulfide bond rupture or chemical modification of certain groups in the 
protein if these processes are also accompanied by changes in its overall three- 
5 dimensional structure. 

We also do not exclude renatured proteins or nucleic acid in which the denatured 
protein or nucleic acid is returned to the conformation it maintained before 
denaturation. Reversible denaturation is generally brought about by disulfide 
1 0 reducing agents, and urea, for nucleic acid by heat and salts. 

For the avoidance of doubt, by native state we mean the form in which the protein or 
nucleic acid occurs in the intact cell, its three-dimensional structure depending on 
formation of the appropriate hydrogen bonds. 

Preferably the first region is at the amino terminal end of the second region. 

According to another aspect of the present invention there is provided nucleic acid 
encoding the conjugate of the present invention. 

According to another aspect of the present invention there is provided an expression 
vector comprising the nucleic acid of the present invention operably linked to a 
promoter. 

According to yet another aspect of the present invention there is provided a host cell 
transformed with the expression vector of the present invention. 



The development of an appropriate procedure for obtaining the conjugate from 
bacterial lysate was important. With conventional methods we found that expo 
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to denaturing reagents dramatically affect the translocation property of the Antp 
homeodomain. We also found that small changes in pH and differences in osmolarity 
affected its translocation property. 

5 We have now found a general method for preparing the conjugate of the present 
invention, and in particular an appropriate procedure for purifying the first domain 
and fusion proteins from the bacterial lysate. This method allows conjugates to be 
obtained which will translocate a protein of at least 100 amino acids across the cell 
membrane. 

10 

Thus according to another aspect of the present invention there is provided a method 
for preparing a conjugate comprising: 

(i) culturing the host cell according to the present invention under conditions 
15 which provide for the expression of the conjugate from the expression vector within 

the host cell; and 

(ii) recovering the conjugate, which recovery comprises fusing an amino acid tail 
to the conjugate, which tail is capable of binding to at least one substrate and not to 
another substrate, and wherein the conjugate is caused to bind via the tail to at least 

20 one substrate such that components of the host cell do not bind to this substrate; and 
the conjugate is contacted with the other substrate such that the conjugate is not 
bound and remaining components of the host cell are bound to the other substrate. 

According to yet another aspect of the present invention there is provided a method 
25 for preparing a conjugate comprising: 

(i) culturing a host cell, transformed with an expression vector comprising 
nucleic acid, operably linked to a promoter, encoding (a) a first region comprising the 
homeodomain of antennapedia or a mutant thereof; and (b) a second region not 
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naturally associated with the first region comprising an amino acid sequence of at 
least 100 amino acids in length, under conditions which provide for expression of the 
conjugate from the expression vector within the host cell; and 
(ii) recovering the conjugate, which method comprises comprises fusing an 
5 amino acid tail to the conjugate, which tail is capable of binding to at least one 
substrate and not to another substrate, and wherein the conjugate is caused to bind 
via the tail to at least one substrate such that components of the host cell do not bind 
to this substrate; and the conjugate is contacted with the other substrate such that the 
conjugate is not bound and remaining components of the host cell are bound to the 
10 other substrate. 

According to a further aspect of the present invention there is provided a method of 
purifying a conjugate comprising fusing an amino acid tail to the conjugate, which 
tail is capable of binding to at least one substrate and not to another substrate, and 
wherein the conjugate is caused to bind via the tail to at least one substrate such that 
impurities do not bind to this substrate; and the conjugate is contacted with the other 
substrate such that the conjugate is not bound and remaining impurities are bound to 
the other substrate. 

Thus in broad terms the present invention can be seen as the use of a tail attached to 
the conjugate which allows both positive and negative purification steps. 

Preferably the amino acid tail is fused to the carboxy tenninai end of the conjugate. 
Preferably the amino acid tail comprises HHHHHHGS. 

Preferably the substrate is a nickel column or an antibody with affinity for the amino 
acid tail. 
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According to a preferred embodiment the conjugate is consecutively contacted with 
two substrates with which it has affinity via the amino acid tail. 

Thus preferably both the nickel column and antibody are used. They may be used in 
5 any order. 

When the second region is a DNA binding domain, a complex with nucleic acid may 
be formed by mixing the conjugate formed in accordance with the invention with the 
nucleic acid. 

10 

Further aspects of the present invention include a conjugate prepared by the method 
of the present invention; a pharmaceutical composition comprising the conjugate of 
the present invention and the use of the conjugate of the present invention in the u 
preparation of a medicament for the treatment of a disease or infection. 

15 

In accordance with the invention, standard molecular biology techniques may be 
used which are within the level of skill in the art. Such techniques are fully 
described in the literature. See for example; Sambrook et al (1989) Molecular 
Cloning; a laboratory manual; Hames and Glover (1985 - 1997) DNA Cloning: a 
20 practical approach, Volumes I- IV (second edition); Methods for the engineering of 
immunoglobulin genes are given in McCafferty et al (1996) "Antibody 
Engineering: A Practical Approach 

Various preferred features and embodiments of the present invention will now be 
25 described by way of non-limiting example, and with reference to the accompanying 
drawings in which: 



Figure 1 shows the structure and sequence of the antennapedia homeodomain 
obtainable from Drosophila; and 



p 
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Figure 2 further shows two mutants, designated pAntp 50H and pAntp 40P2. 
First Region 

The first region of the conjugate of the invention may comprise a natural or synthetic 
homeodomain of antennapedia. 

The homeodomain of the Antp gene obtainable from Drosophila is shown in Figure 1 
and in Seq ID No. 1. Sequences homologous to this homeodomain have been isolated 
from other organisms, including vertebrates, mammals and humans, and these are 
included in the present invention. The homeodomain may be prepared using standard 
techniques such as cloning using the procedure described in Joliet et al (1 991) 
Antennapedia homeobox peptide regulates neural morphogenesis. Proc. Natl. acad. 
Sci. 88 1864-1868. As previously indicated differences in the sequences of such 
multicellular organisms are generally conservative in nature. However, this may not 
necessarily be the case and other such sequences are included in the present 
invention, and for example where the sequence identity is about 50% or more e g 
60%, 70%, 80% or 90%, with the sequence obtainable from Drosophila Sequence 
20 identity may be determined using such commercially available programmes as GAP. 

In addition synthetic variants may be used provided that they retain the ability to 
translocate the membrane. Synthetic variants will generally differ from the naturally- 
occurring proteins by substitution, particularly conservative substitution. By 
conservative amino acid changes we mean replacing an amino acid from one of the 
amino acid groups, namely hydrophobic, polar, acidic or basic, with an amino acid 
from within the same group. An example of such a change is the replacement of 
valine by methionine and vice versa. Other examples of conservative substitutions 
may be seen by reference to the following table: 



25 
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ALIPHATIC 


Non-polar 


GAP ; 


ILV 


Polar - uncharged 


CSTM 


NQ 


Polar - charged 


DE 


RK 


AROMATIC 


HFWY 


OTHER 


NQDE 



Such variants may be prepared using standard recombinant DNA techniques such as 
site-directed mutagenesis. Where insertions are to be made, synthetic DNA encoding 
the insertion together with 5' and 3' flanking regions corresponding to the naturally- 
5 occurring sequence either side of the insertion site. The flanking regions will contain 
convenient restriction sites corresponding to sites in the naturally-occurring sequence 
so that the sequence may be cut with the appropriate enzyme(s) and the synthetic 
DNA ligated into the cut. The DNA is then expressed in accordance with the 
invention to make the encoded protein. These methods are only illustrative of the 
10 numerous standard techniques known in the art for manipulation of DNA sequences 
and other known techniques may also be used. 

The ability of a naturally occurring or synthetic sequence to translocate the 
membrane may be tested by routine methods known in the art and illustrated in the 
1 5 accompanying examples. 

Some variants of the homeodomain which retain the ability to translocate the 
membrane have been reported in the art and these are included in the scope of the 
present invention, together with any which become available. 

20 

i" 
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For example, EP-B-0 485 578 to CNRS discloses homeopeptides comprising the 
helix 3 sequence of pAntp, and these are incorporated herein by reference. 

W097/12912 also to CNRS discloses the actual sequence of the helix 3 of pAntp, 
5 and variants thereof. These also are incorporated herein by reference. In particular^ 
the 3 helix is said to have the sequence: 

Arg-Gm-De-Lys-De-Trp-Phe-Gm-Asn-Arg-Arg-Met-Lys-Trp-Lys^ 
The variants are said to have the sequence: 

X1-X2-X3-X4-X5-X6-X7-X8-X9-X10-X11-X12-X13-X14-X15-X16 
10 or 

X16-X15-X14-X13-X12-X11-X10-X9-X8-X7-X6-X5-X4-X3-X2-X1 

wherein each X represents an a-amino acid, with X6 representing trytophane; said 

peptide comprising between 6 and 10 hydrophobic amino acids. 

15 Other variants are desclosed in for example, Gehring W (1 987) Homeo Boxes in the 
Study of Development. Science 236 1245-1252 discloses a homeodomain of 62 
amino acids, i.e. with glu at position 0 and lys at position 6 1 . Bloch-Gallego E at al 
(1993) Antennapedia Homeobox Peptide Enhances Growth and Branching of 
Embryonic Chicken Motoneurons In Vitro. The Journal of Cell Biology 120(2) 485- 
492 discloses a mutant called pAnt P 40P2 that was still able to translocate through the 
motoneuron membrane and to reach the nucleus. In this mutant the leucine and 
threonine residues in positions 40 and 41 were replaced by two proline residues. Le 
Roux et al (1993) Neurotropic activity of the Antennapedia homeodomain depends 
on its specific DNA-binding properties. Proc. Natl. Acad. Sci. 90 9 120-91 24 
25 discloses two mutants pAntp 50A and pAntp 40P2 as shown in Figure 2 which retain 
the ability to translocate through the neuronal membrane. Schutze-Redehneier M-P 
et al (1996) supra disclose that a 1 6 amino acid C-terminal (third helix) segment has 
been used to address oligonucleotides and oligopeptides to the cytoplasm and nuclei 
of cells in culture. 



20 
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However, whilst not wishing to be bound by any theory it is believed that for 
delivering the proteins of the present invention an amino acid sequence of about 60 is 
preferred. 

5 

Cleavable linker region 

Preferably, the first and second regions are linked by a cleavable linker region this 
may be any region suitable for this purpose. Preferably, the cleavable linker region is 

10 a protease cleavable linker, although other linkers, cleavable for example by small 
molecules, may be used. These include Met-X sites, cleavable by cyanogen bromide, 
Asn-Gly, cleavable by hydroxylamine, Asp-Pro, cleavable by weak acid and Trp-X 
celavable by, inter alia, NBS-skatole. Protease cleavage sites are preferred due to the 
milder cleavage conditions necessary and are found in, for example, factor Xa, 

15 thrombin and collagenase. Any of these may be used. The precise sequences are 
available in the art and the skilled person will have no difficulty in selecting a 
suitable cleavage site. By way of example, the protease cleavage region targeted by 
Factor Xa is I E G R. The protease cleavage region targeted by Enterokinase is D D 
DDK. The protease cleavage region targeted by Thrombin is L V P R G. Preferably 

20 the cleavable linker region is one which is targeted by endocellular proteases. 

Second Region 
Fusion protein 

25 

The second region of the conjugate according to the invention may comprise any 
protein sequence of interest (hereinafter POI) which is not naturally associated with 
the first region. Usually this will mean that the POI will be found in nature encoded 
by a gene different from the gene encoding the first region. The second region may 
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be from the same species as the first region, but are present in the conjugate of the 
invention in a manner different from the natural situation, or from a different species. 

The second region of the present invention is preferably at least 100 amino acids in 
length. The present invention is particularly useful for longer sequences, e.g. at least 
150, 200, 300, 400 or 1000 amino acids in length. For the avoidance of doubt the 
term "protein" as used herein also encompasses polypeptides of the required length; 
although by the term "polypeptide" we generally mean sequences of from 2 to 100 ' 
amino acids in length, usually 2 up to 60. 

Protein/nucleic acid complex 



20 



Within the present invention the nucleic acid comprises any nucleic acid of interest 
(hereinafter NOI), which may be e.g. therapeutically active or a reporter gene. The 
1 5 NOI may be DNA or RNA. In one embodiment the NOI is an oligonucleotide. 

When the present invention relates to a protein/nucleic acid complex, the complex 
preferably further comprises a nucleic acid (or DNA) binding domain as part of the 
second region. The nucleic acid binding domain serves to mediate the specific, high 
affinity and non-covalent interaction of the protein component with the nucleic acid 
component. 

The nucleic acid binding domain may be an RNA binding domain, or preferentially, 
a DNA binding domain, e.g. the DNA bicining domain of a transcription factor, 
particularly a yeast or human transcription factor. Preferred is A GAL4 derivable 
domain, mediating the selective binding of the protein of the invention to the DNA 
sequence CGGAGGACAGTCCTCCG (Cavey et al J Mol Biol 209:423, 1 989). Most 
preferably the DNA binding domain consists of GAL4 amino acids 2 to 147. A DNA 
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binding domain may bind to single-stranded or to a double stranded DNA on the 
second domain. 

Another suitable domain would be a histone. 

5 Applications for the conjugates of the present invention include (there may be 
overlap between these applications): 

1 Antigen delivery system. An antigen is any agent that when introduced into 
an immunocompetent animal stimulates the production of a specific antibody or 

1 0 antibodies that can combine with the antigen. However, the antigen may need to be 
combined with a carrier to be able to stimulate antibody production. This is where 
the present invention comes in as it acts as a carrier by transporting the antigen from 
one side of the cell membrane to the other such that it can stimulate antibody 
production. By way of example, bacterial and viral antigens translocated by Antp in 

15 the cell cytoplasm may be processed and associated with MHC class I molecules. 
This antigen processing and presenting pathway is known to activate specific CD8 
cytoxic lymphocytes. 

2. Gene therapy. Gene therapy may include any one or more of: the addition, 
20 the replacement, the deletion, the supplementation, the manipulation etc. of one or 
more nucleotide sequences in, for example, one or more targeted sites - such as 
targeted cells. If the targeted sites are targeted cells, then the cells may be part of 
a tissue or an organ. General teachings on gene therapy may be found in 
Molecular Biology (Ed Robert Meyers, Pub VCH, such as pages 556-558). 

25 

By way of further example, gene therapy can also provide a means by which any 
one or more of: a nucleotide sequence, such as a gene, can be applied to replace or 
supplement a defective gene; a pathogenic nucleotide sequence, such as a gene, or 
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expression product thereof can be eliminated; a nucleotide sequence, such as a 
gene, or expression product thereof, can be added or introduced in order, for 
example, to create a more favourable phenotype; a nucleotide sequence, such as a 
gene, or expression product thereof can be added or introduced, for example, for 
5 selection purposes (i.e. to select transformed cells and the like over non- 
transformed cells); cells can be manipulated at the molecular level to treat, cure or 
prevent disease conditions - such as cancer (Schmidt-Wolf and Schmidt-Wolf, 
1994, Annals of Hematology 69;273-279) or other disease conditions, such as 
immune, cardiovascular, neurological, inflammatory or infectious disorders; 
10 antigens can be manipulated and/or introduced to elicit an immune response! such 
as genetic vaccination. In a particularly preferred embodiment, Antp may be used to 
introduce functional proteins in the cytoplasm of genetically deficient cell types. 

3. Cancer therapy. Antp may be used to transport into cancer cells molecules 
15 that are transcription factors and are able to restore cell cycle control or induce 
differentiation. For example it is understood that many cancer cells would undergo 
apoptosis if a functional P-53 molecule is introduced into their cytoplasm. The 
present invention may be used to deliver such gene products. 

20 4. Develop antibacterial and antiviral measures. For example, Antp may be used 
to transport in the cytoplasm of infected cells recombinant antibodies or DNA 
binding molecules and which interfere with a crucial step of bacterial and viral 
replication. 



25 



We have now generated recombinant chimeric proteins constituted by the 
homeodomain of antennapedia fused either to the 85A antigen of Mycobacterium 
tuberculosis or to a single chain recombinant antibody directed against the malaria 
antigen Pb21. 
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5. Use in expression systems. For example, it is desirable to express exogenous 
proteins in eukaryotic cells so that they get processed correctly. However, many 
exogenous proteins are toxic to eukaryotic cells. In manufacturing exogenous 

5 proteins it is therefore desirable to achieve temporal expression of the exogenous 
protein. The system may therefore be used in connection with an inducible promoter 
for this or any other application involving such a system. In one such system, the 
loxP/cre system, the eukaryotic cell contains a gene encoding the protein of interest, 
functionally linked to a repressor. Thus, in the normal situation, the protein is not 
10 expressed. The Antp homeodomain may be fused to Cre recombinase which will 
excise the repressor gene. Application of this conjugate to the eukaryotic cell will 
thus cause expression of the protein. 

6. Receptor-mediated signalling. A receptor is a cellular macromolecule that 
1 5 undergoes combination, with a hormone, neurotransmitter, drug or intracellular 

messenger to initate a change in cell function. Receptors are concerned directly and 
specifically in chemical signalling between and within cells. By receptor-mediated 
we mean a response that requires the intermediary action of a receptor. Thus the 
present invention which is required to initate a change in cell function. 

20 

7. Protein sorting. 

8. DNA synthesis. 

25 The invention is particularly important because it transports to the nuclei of cells, 
functional transcription factors which interfere with the endogenous expression of 
genes and/or which interfere with the process of viral or bacterial replication. 
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Suitable NOI coding sequences and POIs include those that are of therapeutic 
and/or diagnostic application such as, but are not limited to: sequences encoding or 
POIs which are cytokines, chemokines, hormones, antibodies, engineered 
inimunoglobulin-like molecules, a single chain antibody, conjugates, enzymes, 
immune co-stimulatory molecules, immunomodulatory molecules, anti-sense RNA, 
a transdominant negative mutant of a target protein, a toxin, suchas endotoxin A, ' 
Colicin A, d-endotoxin, diphtheria toxin, Bacillus anthrox toxin, Cholera toxin, 
Pertussis toxin, E. coli toxins, Shigatoxin or a Shiga-like toxin, a conditional 
toxin, an antigen, a tumour suppressor protein and growth factors, membrane 
proteins, vasoactive proteins and peptides, anti-viral proteins and ribozymes, and 
derivatives therof (such as with an associated reporter group). When included, 
such coding sequences may be typically operatively linked to a suitable promoter, 
which may be a promoter driving expression of a ribozyme(s), or a different 
promoter or promoters. 



15 



The conjugate of the present invention may be used to deliver one or more NOI(s) 
or POI(s) useful in the treatment of the disorders listed in WO-A-98/05635. For 
ease of reference, part of that list is now provided: cancer, inflammation or 
inflammatory disease, dermatological disorders, fever, cardiovascular effects, 
20 haemorrhage, coagulation and acute phase response, cachexia, anorexia, acute 
infection, HIV infection, shock states, graft-versus-host reactions, autoimmune 
disease, reperfusion injury, meningitis, migraine and aspirin-dependent anti- 
thrombosis; tumour growth, invasion and spread, angiogenesis, metastases, 
malignant, ascites and malignant pleural effusion; cerebral ischaemia, ischaemic 
heart disease, osteoarthritis, rheumatoid arthritis, osteoporosis, asthma, multiple 
sclerosis, neurodegeneration, Alzheimer's disease, atherosclerosis, stroke, 
vasculitis, Crohn's disease and ulcerative colitis; periodontitis, gingivitis; 
psoriasis, atopic dermatitis, chronic ulcers, epidermolysis bullosa; corneal 



25 
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ulceration, retinopathy and surgical wound healing; rhinitis, allergic conjunctivitis, 
eczema, anaphylaxis; restenosis, congestive heart failure, endometriosis, 
atherosclerosis or endosclerosis. 

5 In addition, or in the alternative, the conjugate of the present invention may be 
used to deliver one or more NOI(s) or POI(s) useful in the treatment of disorders 
listed in WO-A-98/07859. For ease of reference, part of that list is now provided: 
cytokine and cell proliferation/differentiation activity; immunosuppressant or 
immunostimulant activity (e.g. for treating immune deficiency, including infection 

10 with human immune deficiency virus; regulation of lymphocyte growth; treating 
cancer and many autoimmune diseases, and to prevent transplant rejection or 
induce tumour immunity); regulation of haematopoiesis, e.g. treatment of myeloid 
or lymphoid diseases; promoting growth of bone, cartilage, tendon, ligament and -w* 
nerve tissue, e.g. for healing wounds, treatment of burns, ulcers and periodontal fc 

15 disease and neurodegeneration; inhibition or activation of follicle-stimulating 
hormone (modulation of fertility); chemotactic/chemokinetic activity (e.g. for 
mobilising specific cell types to sites of injury or infection); haemostatic and 
thrombolytic activity (e.g. for treating haemophilia and stroke); antiinflammatory 

activity (for treating e.g. septic shock or Crohn's disease); as antimicrobials; ^ 
20 modulators of e.g. metabolism or behaviour; as analgesics; treating specific 
deficiency disorders; in treatment of e.g. psoriasis, in human or veterinary 

medicine. 1 

In addition, or in the alternative, the conjugate of the present invention may be 
25 used to deliver one or more NOI(s) or POI(s) useful in the treatment of disorders 
listed in WO-A-98/09985 . For ease of reference, part of that list is now provided: 
macrophage inhibitory and/or T cell inhibitory activity and thus, anti-inflammatory 
activity; anti-immune activity, i.e. inhibitory effects against a cellular and/or 

m 
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humoral immune response, including a response not associated with inflammation; 
inhibit the ability of macrophages and T cells to adhere to extracellular matrix 
components and fibronectin, as well as up-regulated fas receptor expression in T 
cells; inhibit unwanted immune reaction and inflammation including arthritis, 
> including rheumatoid arthritis, inflammation associated with hypersensitivity, 
allergic reactions, asthma, systemic lupus erythematosus, collagen diseases and 
other autoimmune diseases, inflammation associated with atherosclerosis, 
arteriosclerosis, atherosclerotic heart disease, reperfusion injury, cardiac arrest, 
myocardial infarction, vascular inflammatory disorders, respiratory distress 
syndrome or other cardiopulmonary diseases, inflammation associated with peptic 
ulcer, ulcerative colitis and other diseases of the gastrointestinal tract, hepatic 
fibrosis, liver cirrhosis or other hepatic diseases, thyroiditis or other glandular 
diseases, glomerulonephritis or other renal and urologic diseases, otitis or other 
oto-rhino-laryngological diseases, dermatitis or other dermal diseases, periodontal 
diseases or other dental diseases, orchitis or epididimo-orchitis, infertility, orchidal 
trauma or other immune-related testicular diseases, placental dysfunction, placental 
insufficiency, habitual abortion, eclampsia, pre-eclampsia and other immune 
and/or inflammatory-related gynaecological diseases, posterior uveitis, 
intermediate uveitis, anterior uveitis, conjunctivitis, chorioretinitis, uveoretinitis, 
optic neuritis, intraocular inflammation, e.g. retinitis or cystoid macular oedema, 
sympathetic ophthalmia, scleritis, retinitis pigmentosa, immune and inflammatory 
components of degenerative fondus disease, inflammatory components of ocular 
trauma, ocular iiiflarnmation caused by infection, proliferative vitreo-retinopathies, 
acute ischaemic optic neuropathy, excessive scarring, e.g. following glaucoma 
filtration operation, immune and/or inflammation reaction against ocular implants 
and other immune and inflammatory-related ophthalmic diseases, inflammation 
associated with autoimmune diseases or conditions or disorders where, both in the 
central nervous system (CNS) or in any other organ, immune and/or inflammation 
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suppression would be beneficial, Parkinson's disease, complication and/or side 
effects from treatment of Parkinson's disease, AIDS-related dementia complex 
HIV-related encephalopathy, Devic's disease, Sydenham chorea, Alzheimer's 
disease and other degenerative diseases, conditions or disorders of the CNS, 
5 inflammatory components of stokes, post-polio syndrome, immune and 

inflammatory components of psychiatric disorders, myelitis, encephalitis, subacute 
sclerosing pan-encephalitis, encephalomyelitis, acute neuropathy, subacute 
neuropathy, chronic neuropathy, Guulaim-Barre syndrome, Sydenham chora, 
myasthenia gravis, pseudo-tumour cerebri, Down's Syndrome, Huntington's 

10 disease, amyotrophic lateral sclerosis, inflammatory components of CNS 

compression or CNS trauma or infections of the CNS, inflammatory components 
of muscular atrophies and dystrophies, and immune and inflammatory related 
diseases, conditions or disorders of the central and peripheral nervous systems, 
post-traumatic inflammation, septic shock, infectious diseases, inflammatory 

15 complications or side effects of surgery, bone marrow transplantation or other 
transplantation complications and/or side effects, inflammatory and/or immune 
complications and side effects of gene therapy, e.g. due to infection with a viral 
carrier, or inflammation associated with AIDS, to suppress or inhibit a humoral 
and/or cellular immune response, to treat or ameliorate monocyte or leukocyte 

20 proliferative diseases, e.g. leukaemia, by reducing the amount of monocytes or 
lymphocytes, for the prevention and/or treatment of graft rejection in cases of 
transplantation of natural or artificial cells, tissue and organs such as cornea, bone 
marrow, organs, lenses, pacemakers, natural or artificial skin tissue. 

25 The present invention also provides a pharmaceutical composition for treating an 
individual by gene therapy, wherein the composition comprises a therapeutically 
effective amount of the conjugate of the present invention. The pharmaceutical 
composition may be for human or animal usage. Typically, a physician will 
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determine the actual dosage which will be most suitable for an individual subject 
and it will vary with the age, weight and response of the particular individual. 

The composition may optionally comprise a pharmaceutical^ acceptable carrier, 
diluent, excipient or adjuvant. The choice of pharmaceutical carrier, excipient or 
diluent can be selected with regard to the intended route of adniinistration and 
standard pharmaceutical practice. The pharmaceutical compositions may comprise 
as - or in addition to - the carrier, excipient or diluent any suitable binder(s), 
lubricant(s), suspending agent(s), coating agent(s), solubilising agent(s), and other 
carrier agents that may aid or increase entry into the target site (such as for 
example a lipid delivery system). 



Where appropriate, the pharmaceutical compositions can be administered by any 
one or more of: inhalation, in the form of a suppository or pessary, topically in the 
15 form of a lotion, solution, cream, ointment or dusting powder, by use of a skin 
patch, orally in the form of tablets containing excipients such as starch or lactose, 
or in capsules or ovules either alone or in admixture with excipients, or in the 
form of elixirs, solutions or suspensions containing flavouring or colouring agents, 
or they can be injected parenterally, for example intracavernosally, intravenously, 
20 intramuscularly or subcutaneously. For parenteral administration, the 

compositions may be best used in the form of a sterile aqueous solution which may 
contain other substances, for example enough salts or monosaccharides to make the 
solution isotonic with blood. For buccal or sublingual administration the 
compositions may be administered in the form of tablets or lozenges which can be 
25 formulated in a conventional manner. 

The delivery of one or more therapeutic genes or proteins according to the invention 
may be used alone or in combination with other treatments or components of the 
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treatment. Diseases which may be treated include, but are not limited to: cancer, 
neurological diseases, inherited diseases, heart disease, stroke, arthritis, viral 
infections and diseases of the immune system. Suitable therapeutic genes include 
those coding for tumour suppressor proteins, enzymes, pro-drug activating enzymes, 
immunomodulatory molecules, antibodies, engineered immunoglobulin-like 
molecules, conjugates, hormones, membrane proteins, vasoactive proteins or 
peptides, cytokines, chemokines, anti-viral proteins, antisense RNA and ribozymes. 

The conjugate may also contain one or more cytokine-encoding NOIs or cytokines. 
Suitable cytokines and growth factors include but are not limited to: ApoE, Apo- 
SAA, BDNF, Cardiotrophin-1, EGF, ENA-78, Eotaxin, Eotaxin-2, Exodus-2, FGF- 
acidic, FGF-basic, fibroblast growth factor-10 (Marshall 1998 Nature Biotechnology 
16: 129).FLT3 ligand (Kimura et al. (1997), Fractalkine (CX3C), GDNF, G-CSF, 
GM-CSF, GF-pl, insulin, IFN-y, IGF-I, IGF-II, IL-la, IL-lp, IL-2, H-3, IL-4, IL-5, 
IL-6, H-7, IL-8 (72 a.a.), IL-8 (77 a.a.), IL-9, IL-10, IL-11, IL-12, IL-13, IL-15, IL- 
16, IL-17, Hrl8 (IGIF), Inhibin a, Tnhibin P, IP-10, keratinocyte growth fector-2 
(KGF-2), KGF, Leptin, LIF, Lymphotactin, Mullerian inhibitory substance, 
monocyte colony inhibitory factor, monocyte attractant protein (Marshall 1998 ibid), 
M-CSF, MDC (67 a.a.), MDC (69 a.a.), MCP-1 (MCAF), MCP-2, MCP-3, MCP-4, 
MDC (67 a.a.), MDC (69 a.a.), MIG, MIP-lo, MIP-ip, MIP-3a, ME»-3p, MIP-4, 
myeloid progenitor inhibitor factor- 1 (MPIF-1), NAP-2, Neurturin, Nerve growth 
factor, P-NGF, NT-3, NT-4, Oncostatin M, PDGF-AA, PDGF-AB, PDGF-BB, PF-4, 
RANTES, SDFla, SDFlp, SCF, SCGF, stem cell factor (SCF), TARC, TGF-a, 
TGF-P, TGF-P2, TGF-P3, tumour necrosis factor (TNF), TNF-a, TNF-P, TNIL-1, 
TPO, VEGF, GCP-2, GRO/MGSA, GRO-p, GRO-y, HCC1, 1-309, 

The conjugate of the present invention may comprise further suitable domains. These 
will be known to those skilled in the art. For example an endoplasmic reticulum 
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retention signal functions to affect the intracellular routing of the internalized 
conjugate or protein/nucleic acid complex of the present invention. A suitable 
endoplasmic retention signal may be a mammalian endoplasmic reticulum retention 
signal. 

5 

Also present may be a translocation domain which serves to enhance nucleic acid or 
protein escape from the cellular vesicle system and thus to augment nucleic acid 
transfer by this route. This domain may serve to reduce or avoid lysomal degradation 
after internalization of the protein/nucleic acid into the target cell. Suitable 
1 0 translocation domains are derivable from toxins, particularly bacterial toxins, such as 
exotoxin A, Colicin A, d-endotoxin, diphtheria toxin, Bacillus anthrax toxin, Cholera 
toxin, Pertussis toxin, E. coli toxin toxins, Shigatoxin or Shiga-like toxin. 

The first domain of the present invention may be modified to target cell sites other 
15 than the nucleus. 



20 



25 



Additionally, or alternatively, also present may be a target cell-specific binding 
domain recognising a cell surface structure, such as a receptor protein or surface 
antigen on the target cell. 

Nucleic acids. 

The invention also provides nucleic acid encoding the conjugates of the invention. 
These may be constructed using standard recombinant DNA methodologies. The 
nucleic acid may be RNA or DNA and is preferably DNA. Where it is RNA, 
manipulations may be performed via cDNA intermediates. Generally, a nucleic acid 
sequence encoding the first region will be prepared and suitable restriction sites 
provided at the 5' and/or 3' ends. Conveniently the sequence is manipulated in a 
standard laboratory vector, such as a plasmid vector based on pBR322 or pUC 1 9 (see 
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below). Reference may be made to Molecular Cloning by Sambrook et al (Cold 
Spring Harbor, 1989) or similar standard reference books for exact details of the 
appropriate techniques. 

5 Nucleic acid encoding the second region may likewise be provided in a similar vector 
system. Sources of nucleic acid may be ascertained by reference to published 
literature or databanks such as Genbank. 

Nucleic acid encoding the desired first or second region may be obtained from 
10 academic or commercial sources where such sources are willing to provide the 
material or by synthesising or cloning the appropriate sequence where only the 
sequence data are available. Generally this may be done by reference to literature 
sources which describe the cloning of the gene in question. 

15 Alternatively, where limited sequence data are available or where it is desired to 
express a nucleic acid homologous or otherwise related to a known nucleic acid, 
exemplary nucleic acids can be characterised as those nucleotide sequences which 
hybridise to the nucleic acid sequences known in the art. 

20 Stringency of hybridisation refers to conditions under which polynucleic acids 
hybrids are stable. Such conditions are evident to those of ordinary skill in the field. 
As known to those of skill in the art, the stability of hybrids is reflected in the 
melting temperature (Tin) of the hybrid which decreases approximately 1 to 1.5°C 
with every 1% decrease in sequence homology. In general, the stability of a hybrid is 

25 a function of sodium ion concentration and temperature. Typically, the hybridisation 
reaction is performed under conditions of higher stringency, followed by washes of 
varying stringency. 
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As used herein, high stringency refers to conditions that permit hybridisation of only 
those nucleic acid sequences that form stable hybrids in 1 M Na+ at 65-68 °C. High 
stringency conditions can be provided, for example, by hybridisation in an aqueous 
solution containing 6x SSC, 5x Denhardt's, 1 % SDS (sodium dodecyl sulphate), 0.1 
5 Na+ pyrophosphate and 0.1 mg/ml denatured salmon sperm DNA as non specific 
competitor. Following hybridisation, high stringency washing may be done in 
several steps, with a final wash (about 30 min) at the hybridisation temperature in 0.2 
-O.lxSSC, 0.1 %SDS. 

10 Moderate stringency refers to conditions equivalent to hybridisation in the above 
described solution but at about 60-62°C. In that case the final wash is performed at 
the hybridisation temperature in lx SSC, 0.1 % SDS. 

Low stringency refers to conditions equivalent to hybridisation in the above 
15 described solution at about 50-52°C. In that case, the final wash is performed at the 
hybridisation temperature in 2x SSC, 0.1 % SDS. 

It is understood that these conditions may be adapted and duplicated using a variety 
of buffers, e.g. formamide-based buffers, and temperatures. Denhardt's solution and 

20 SSC are well known to those of skill in the art as are other suitable hybridisation 
buffers (see, e.g. Sambrook, et ai, eds. (1989) Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory Press, New York or Ausubel, et a!., eds. 
(1990) Current Protocols in Molecular Biology, John Wiley & Sons, Inc.). Optimal 
hybridisation conditions have to be determined empirically, as the length and the GC 

25 content of the probe also play a role. 

Expression vectors and host celte 
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llie nucleic acid encoding a conjugate according to the invention, or constituent 
part(s) thereof, can be incorporated into vectors for further manipulation. As used 
herein, vector (or plasmid) refers to discrete elements that are used to introduce 
heterologous DNA into cells for either expression or replication thereof. Selection 
and use of such vehicles are well within the skill of the artisan. Many vectors are 
available, and selection of appropriate vector will depend on the intended use of the 
vector, i.e. whether it is to be used for DNA amplification or for DNA expression, 
the size of the DNA to be inserted into the vector, and the host cell to be transformed 
with the vector. Each vector contains various components depending on its function 
(amplification of DNA or expression of DNA) and the host cell for which it is 
compatible. The vector components generally include, but are not limited to, one or 
more of the following: an origin of replication, one or more marker genes, an 
enhancer element, a promoter, a transcription termination sequence and a signal 
sequence. 

Production of Antibodies. 

Conjugates according to the invention may be used directly as immunogens, without 
the use of further adjuvants, to generate antisera and monoclonal antibodies. 



In accordance with yet another embodiment of the present invention, there are 
provided antibodies specifically recognising and binding the conjugates according to 
the invention. More preferably, however, the antibodies are specific for the second 
region of the conjugates, that is the polypeptide which is fused to the gene product of 
25 the invention in order to achieve expression thereof. Advantageously, the second 
region of the conjugate is recognised by the antibodies when in its natural context. 
Thus, where the second region is an isolated peptide or domain from a larger protein, 
that peptide or domain is recognised by the antibodies of the invention in the context 
of the whole of the larger protein. 



WO 99/11809 



PCT/GB98/02628 



26 



The invention moreover provides a method for preparing an immunoglobulin, 
comprising the steps of: 



5 a) immunising an animal with a conjugate according to the present invention : 
and 

b) recovering immunoglobulin specific for a region of the conjugate from the 
serum of the animal. 

10 The antibodies (or immunoglobulins) may be isolated in the form of a crude 
preparation, i.e. an antiserum, by affinity chromatography against the conjugate. 

Expression and p urification of^jgfB^onj ugates 

15 Whilst not wishing to be bound by any theory we believe the ability of Antp- 
conjugates to translocate across the cell surface membrane may be very much 
dependent on the conformation of the recombinant proteins. We could not observe 
any translocation of the polypeptide by using either bacteria cell extracts or purified 
proteins exposed to small amounts of detergent (ionic and non-ionic) or denaturating 
20 agents (urea or guanidinuim). This conformation dependent property has represented 
a senous problem for using ^-conjugates in both in vitro and in vivo experiments 
as all reports so far published use either synthetic peptides or bacterial cell extracts 
We have overcome this limitation by purifying the Antp-conjugate under native 
conditions in accordance with the invention. 



25 



Example 1 - We have introduced at the carboxyl-terminal end of the recombinant 
protein a tail of six histidine which confers to those proteins a high affinity for nickel 
ions. Bacteria cells expressing the recombinant proteins were lysed by repeated 
exposure to ultrasounds. Cell lysates were centrifuged and loaded on a nickel 
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column (Quiagen). The protein bound to the column were eluted step wise by 
changing the pH of the buffer. Under these conditions the ^nfp-conjugates could be 
selectively enriched. We have estimated by gel electrophoresis that the Antp- 
conjugates contributed up to 75% of the material eluted from the nickel column in 

5 the pH range between 4.0 to 5.0. The Antp recombinant proteins were further 
purified by affinity chromatography using the antibody 4 D 1 1 directed against the 
histidine tail. The recombinant proteins were eluted from the column by changing 
the pH of the buffer. Our results indicate that fractions eluted at pH 2.8 contained 
the recombinant proteins nearly free of any bacterial contaminants. These protein 

10 fraction were then loaded on a polymixine column to selectively subtract LPS and 
related bacterial contaminants. Gel electrophoresis analysis indicated that the eluate 
of the polymixine column contained the recombiant protein devoid of bacterial 
contaminants (99.5% purity). Purified ,4/j/p-conjugates were tested for their ability to 
translocate across the cell surface membrane of different cell types. Our results 

15 indicate that the purified conjugates Antp-85A are able to translocate across a variety 
of cell lines (Hela, Hep-G2, P815 EL4) and human monocytes. The ability of Antp 
to vehiculate large molecules across the cell membrane was not affected by the 
particular aminc-acid composition of the conjugates. A different Antp-fusion 
molecule, generated by using the malaria antigen Thrombospondin related 

20 anonymous protein (TRAP) of Plasmodium berghei of about 150 amino acids 
(Robson KJ et al Mol Biochem Parasitol 1997; 84(1): 1-12), was translocated across 
the cell membrane as efficiently as the 85A constructs. 

The monoclonal antibody 4 Dl 1 
25 The monoclonal antibody (mAb) 4 D 11 was found by screening in ELISA 

hydridomas generated from a mouse that was immunised with a recombinant protein 
containing a six histidine tail at its amino-terminal end. The antibody is available 
from Imperial College of Science, Technology and Medicine, Sherfield Building 
Exhbition Rd, London SW7 2AZ, UK c/o ic Innovations Ltd, 47 Princes Gate, 
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London SWT 2AZ, UK. A molecular characterisation of the epitope showed that this 
mAb recognises the amino acid sequence HHHHHHGS both at the amino and at the 
carboxyl terminal end of recombinant proteins. The antibody has an IgGl isotype 
and can be easily purified on protein A column. Our results indicate that 4 D 11 
recognises the recombinant proteins containing the HHHHHHGS in EUSA 
immunoblot, immuno-fluorescence. In addition purified 4 D 11 coupled to beads 
(affi-gel or CNBR activated sepharose) can be used to purify recombinant proteins 
under native conditions. 



10 TBC Vaccine 



The present invention may be used to provide a recombinant vaccine that is 
able to induce a humoral as well as a ceil mediated helper and cytotoxic immune 
response against M. tuberculosis antigens. In order to elicit a MCH class I restricted 
15 cytotoxic immune response, the antigens employed in the vaccine should have access 
to the cytoplasmic compartment in the presenting cells of the immunised organism 
The 85A protein is one of the member of the antigenic complex 85 ABC and 
represents the most abundant molecular species secreted M. tuberculosis in the 
culture media and during bacterial infection. Moreover, the protective immune 
20 response induced natural infection is mainly directed against the antigen 85A this 
protein elicits the production of antibodies and cytokines and stimulate the cytotixic 
T cells. This would reproduce the processing and presentation pathways occurring 
dunng the natura! infection of M. tuberculosis and the immunisation procedures 
employing BCG. Whilst not wishing to be bound by theory, it is believed that the 
25 development of such a vaccine would require molecular vehicles for translocating 
bacterial proteins across the cell membrane into the cytoplasmic compartment. 

Example 2 - To demonstrate the possibility of using Antp as delivery system we have 
earned out the following experimental activities: (i) cloning of the 85A full length 
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sequence in expression plasmids; (ii) development of synthetic genes encoding 
chimeric proteins in which the homeodomain of Ant has been inserted at the amino 
terminal end of the proteins 85A; (iii) expression in E. coli and purification of the 
recombinant proteins; (iv) development of a specific antiserum against 85A; and 
5 (v) analysis of the ability of Antp to vehiculate the mycobacterial polypeptides 85A 
into the cytoplasm of HeLa cell in culture. 



A Honing of 85 A coding sequence in th e expression plasmid pDS56/RBSII 
The Ml length 85A gene was amplified in PCR experiments using as template DNA 

10 extracted M. tuberculosis strain H37Rv. As primers for the PCR reaction we have 
used oligonucleotides deduced from the arnino acid sequence of 85A. To facilitate 
cloning in the expression vectors the primer were designed to contain at their 5' end 
the restriction sites Bam HI and Sal I. The amplified sequence was cloned in the 
expression plasmid pDS56/RBSII thereby generating the plasmids pDS/85A. The 

15 amplified sequences 85A was sequenced to rule out the presence of mutations 
introduced during the amplification reaction. The expression unit of the plasmid 
pDS 56 RBSH is under the control of IPTG inducible promoter and yields conjugates 
containing a stretch of six bistidines at their amino- or carboxyl-terrninal end. The 
presence of the histidines confer to the proteins a high affinity for nickel ions. The 

20 recombinant protein 85A was expressed in E. coli and purified by affinity 
chromatography on nickel column. The yields and quality of the proteins recovered 
at the end of purification process were assessed by analysing the samples eluted from 
the affinity column with SDS acrylamide gel electrophoresis. 



25 g Development of synthetic genes encoding th e chimeric proteins Antp-85 A 

The sequence encoding the homeodomain of Antennapedia was amplified in PCR 
experiments using as template the DNA extracted from embryonic cells of D. 
melanogaster. The homeodomain encompasses the sequence of Antp from amino 
acid 297 to 356 and is encoded by a gene sequence that is not interrupted by introns. 
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As primers we have employed oligonucleotides deduced on the basis of the 
homeodomain sequences and containing the Bam HI restriction site at their 5' end. 
The PCR product was sequenced to rule out the presence of mutations introduced by 
PCR in the sequence of the homehodomain. The restriction site Bam HI allowed the 
cloning of the PCR product in the correct reading came in front of the sequence of 
85A in the plasmid P DS/85A encoded me chimeric protein Antp-85A. This protein is 
characterised by the presence of the homeodomain of Antp and the hisidine tail at its 
amino- and carboxyl-terminal ends respectively. The sequence of Antp was 
introduced at the ammo-terminal end to facilitate the coirect folding of this domain 
in the context of chimeric protein sequences. The expression of the chimeric protein 
Antp-85A was assessed by analyising the bacterial lysate on SDS acrylamide gels. In 
immunoblot experiments the serum S-85A recognised molecules migrating with the 
molecular weight predicted for Antp-85A. 

15 ^ Functional analysis of the chimeric p mt qjns Antp -85A 

To assess the ability of the chimeric protein to translocate across the cell membrane 
increasing amounts of purified Antp-85A were added to HeLa cells in culture. As 
control the cells were incubated with recombinant protein 85A that lacks the 
homeodomain of Antp at its ammo-teiminal end. After 3 hours of incubation at 
20 37°C. the cells were fixed in formaldehyde and analysed with the antiserum s-85A to 
reveal the translocation of 85A epitopes into the cytoplasm. In immunofluorescence 
experiments the serum s-85A did not react with HeLa cells that were previously 
incubated with bacterial lysates containing 85A. This finding would indicate that 
these polypeptides are not able to cross the cell membrane. On the contrary the 
25 antiserum s-25 showed a clear cytoplasmic reactivity on HeLa cells that were 
incubated with the chimeric protein Antp-85A. This last result strongly indicated 
that the 85A epitopes were localised within the cytoplasm of cells incubated with 
Antp-85A thus indicating that the homeodomain of Antp has conferred to the 
chimeric proteins the ability to translocate across the superficial membrane. These 
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findings all together would indicate that the strategy employed to vehiculate antigens 
into the cytoplasm across the cell membrane can be used to develop an experimental 
vaccine against tuberculosis. It is anticipated that the purified chimeric proteins 
Antp-85A may elicit in the immunised animals both a humoral and cell mediated 
5 helper and cytotoxic immune response. 

p^ pl* i, . P/cre reguM ^ ex pression nf proteins in eukaryotic cells 

The Cre recombinase is a phage derived enzyme that cuts double strand DNA at 
10 specific sited (LoxP). When two of these sites are present in opposite orientation the 
intervening DNA sequence is excised. Hie Cre recombinase has been shown to work 
in distantly related organisms such as bacteria, yease and mammalian cells. Antp 
may be used to deliver the Cre recombines inside the cell nucleus and for 
manipulating genomic DNA at precise locations in a temporal regulated manner. 
15 One of the possible applications of Antp-Cre fusion is the development off highly 
regulated systems for expressing genes in eukaryotic cells. Transformation vectors 
can be designed for expressing genes under the transcriptional control of a promoter 
containing, within two LoxP sites, DNA elements functioning as target sequences for 
transcription repressor factors. In the absence of Cre recombinase the presence of the 
20 repressor sequence would not allow the promotor to transcribe the genes cloned in 
the vector. By adding the Antp-Cre conjugate to cells the repressor sequence may be 
excised from the promotor thus allowing transcription to start. This type of 
expression vector may be extremely useful for studying the function of eukaryotic 
genes as well as for expressing biologically active molecule in large amounts. 

25 

Pvam pl* 4 Control of viral expression hv antennanedia delivery of recombinant 
transcri ption factors 
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We further investigate^ delivery technology by assessing its ability to transport 
a functional form of the suppressive transcription factor Oct-2.4 into the nuclei of 
cultured cells. This factor is a member of the POU (Pit-Oct-Unc) family of 
transcription factors and originates from an alternative splicing of the Oct-2 gene 
5 transcript which only occurs in neuronal cells. 

Oct-2.4 lacks the activation domain located at the C-tenninal end of Oct-2 though 
it retains the ability to bind to the octamer DNA regulatory element AGTCAAT. In 
contrast to Oct-2, which acts as a positive regulatory factors on B lymphocytes 
10 promoters, Oct-2.4 can function as transcription repressor . Notably, Oct-2.4 was 
shown to suppress the transcription of the herpes simplex virus (HSV) immediate 
early (IE) genes by binding to an octamer related regulatory element in the IE genes 
promoter. Moreover, the ectopic expression of Oct 2.4 in HSV permissive cells 
down regulated the IE gene expression and inhibited the viral lytic cycle thus 
15 indicating that this factor may play an important function in detenrnning the 
resistance of neuronal cells to HSV replication. Recombinant conjugates 
containing the homeodomain of Antp at the animo-terminal end of Oct-2.4 cross the 
cell surface membrane , reach the cell nuclei and down regulate the viral IE 
promoters. 

20 

This proposal has the objective to assess the ability of hAntp to transport Oct-2.4 
across the surface membrane of living cells. To achieve this objective we: i) generate 
hAntp -Oct-2.4 fusion proteins; ii) assess the ability of the fusion proteins to bind to 
HSV IE promoters and to translocate across the cell membrane; iii) analyse the 
25 transcriptional activity of IE promoter constructs in cells treated with hAntp -Oct-2.4 
fusion proteins. The experiments prove the feasibility of manipulating the expression 
of cellular and viral genes by using a non invasive protein delivery system and may 
lead to the development of a novel class of antiviral agents. 



WO 99/11809 



PCT/GB98/02628 



33 

Methods 

DevsioEmgnt of Mg / g constructs: Oct-2.4 is amplified in PCR experiments 

using as template cDNA generated from human B lymphocytes mRNA. These cells 
have been previously shown to express the full length Oct-2 transcription factor. 

5 The Oct-2.4 PCR product is sequenced and used to generate constructs expressing 
recombinant proteins in E. Coli. The sequence encoding the homeodomain of Antp 
from amino acid 297 to 356 has been amplified from Drosophila DNA. This 
sequence has been modified by in situ mutagenesis to abolish its DNA binding 
properties and shown to translocate across the surfece membrane of living cells. The 

10 hAntp sequence is cloned at the 5' end of the Oct -2.4 gene thus generating the 
construct hAntp -Oct-2.4 that will contain hAntp at the ammo-terminal end of Oct- 
2.4. hAntp -Oct-2.4 has a molecular weight of about 40 kDa which fells within the 
size range of proteins translocated by the homeodomain of Antp. The functional 
properties of Oct-2.4 will not be affected by the insertion of the hAntp sequence . 

15 Structural analysis indicates that the suppressive and the DNA binding activity of 
Oct-2.4 are mediated by a 42 amino acid sequence (residues 141 -182) and the 
POU domain respectively which both retain their function in different structural 
contexts as well as in chimeric proteins. We also generate the construct hAntp- 
AOct-2.4 which has been designed to encode a deleted Oct-2.4 factor 

20 encompassing the suppressive 42 amino acid sequence and the POU DNA binding 
domain. L4n/p-AOct-2.4 will retain its suppressive transcriptional activity and 
translocate across the cell membrane more efficiently than hAntp-Oc\-2A because of 
its reduced molecular size 

25 Pv prEssion of Qr.f-7. 4 and hAntn-Oct- 2.4 chi ™"r. proteins in E. coli: hAntp-Oot- 
2.4, li/ln/p-AOct-2.4 and Oct-2.4 are expressed in E. coli using a vector that inserts 
a six histidine tail at their carboxyl-terminal ends. The presence of the His tail will 
allow the purification of the recombinant proteins by nickel chelate chromatography. 
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Experimental evidence indicates that proteins with this structural organisation are 
efficiently transported across the cell membrane. 

To avoid the accumulation of incorrectly folded proteins we insert , at the amino- 
terminal end of L4„^Oct-2.4, h^p-AOct-2.4 and Oct-2.4 the signal sequence of 
Pel B which directs the secretion of recombinant proteins into the bacteria 
periplasm. This strategy has proven to be very convenient to recover functional 
heterologous proteins in & coli. The expression of recombinant proteins in K coli 
are assessed in immunoblot experiments by using the MAb 4D1 1 which recognises 
the histidine tail. 



10 



Membrane translocation and DNA binding nmp prty analysis: The DNA binding 
properties of the recombinant proteins h^-Oct-2.4, L4„/p-AOct-2.4 and Oct-2.4 
are assessed in gel retardation assays using oligonucleotides encompassing the 
natural and the related HSV octamer sequence. The specificity of the interaction will 
15 be investigated in competition assays using wild type and mutated octamer 

oligonucleotides. The hAntp-Oct-2.4 constructs are further investigated for their 
ability to cross the surface membrane of several cell lines including the BHK cells 
(ATCC: CRL-1632) which are susceptible to HSV infection and lack Oct-2 
transcription factors. BHK cells are incubated with affinity purified hAntp-Oct-2.4, 
20 hAntp-AOct-2. Control experiments are carried out by incubating the cells with 

recombinant Oct-2.4 and unrelated bAntp-faswa proteins. The presence and the ' 
distribution ofthe recombinant constructs in the treated cells are analysed in 
immunofluorescence experiments. BHK cells incubated with Mntp-Oct-IA, hAntp- 
AOct-2.4 and Oct-2.4 are fixed, permeabilised and processed for 
25 immunofluorescence using MAb 4 Dl 1 

Analysis ofthe transcriptional activity ^MsiB^M gndJU&B^Qct 2 1- The 
suppressive transcriptional activity of hAntp-Oct-2.4 and li^AOct-2 are assessed 
on BHK cells transformed with a HSV IE promoter reporter construct. A DNA 
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sequence encompassing the IE gene 3 promoter from nucleotide -330 to +33 will 
be linked to the luciferase (Luc) gene in the transformation construct E3-Luc. The 
sectedDNA sequences contains theoctamer related regulatory sequence 
TAATGARAT and is transcriptionally active in BHK cells. The E3-Luc construct 
also contains a G418 resistance gene (mo) transcription unit consisting of: i) the 
MMTV-LTR retroviral promoter-ii) the mo gene- and iii) the SV40 splicing and 
polyadenylation sites. Transformed BHK cell clones are selected with geneticin 
(G418) and analysed by southern blot. Experimental evidence indicates that the HSV 
IE3 promoter will constitutively express high level of luciferase inIE3-Luc 
transformed BHK cells. To assess the ability of Mntp-Oct-IA, hAntp-AOct-2.4 to 
suppress the transcriptional activity of the HSV IE3 promoter, IE3-Luc transformed 
BHK cells are cultured either in the absence or in the presence of different 
concentration of affinity purified hAntp chimeric proteins. As control the 
transformed cells are incubated with recombinant Oct-2.4 and unrelated hAntp 
15 fusion proteins. The transcriptional level of the HSV E3 promoter are assessed by 
measuring the luciferase activity in BHK cell extracts collected at different time 
points after treatment Functional h^p-Oct-2.4, Mntp-AOct-lA chimeric protein 
translocated into the cell nucleus of IE3-Luc transformed BHK will significantly 
down regulate the baseline activity of the HSV IE3 promoter. 



10 



20 



The relevance of the proposed experiments goes beyond the development a delivery 
system potentially able to interfere with the lytic cycle of HSV. The homeodomain of 
Antp has the ability to transport across the cell surface membrane a range of 
functional proteins of great biological and medical relevance. 



25 
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Claims 

1. A conjugate comprising: 

(a) a first region comprising the homeodomain of antennapedia or a variant 
thereof; and 

5 (b) a second region not naturally associated with the first region; 
and wherein at least the first region is non-denatured. 

2. A conjugate according to claim 1 wherein the first and second regions are 
associated via a disulfide bond. 

10 

3. A conjugate according to claim 1 or claim 2 wherein the second region 
comprises an NOL 

4. A conjugate according to claim 1 in the form of a fusion protein. 

15 

5. A conjugate according to claim 4 wherein the second region is a functional or 
regulatory protein. 

6. A conjugate according to claim 4 wherein the second region is an antigen. 

20 

7. A conjugate according to claim 4 wherein the second region is a DNA 
binding domain. 

8. A conjugate according to claim 7 wherein the second region further 
25 comprises an NOL 

9. A conjugate in the form of a fusion protein comprising: 

(a) a first region comprising the homeodomain of antennapedia or a variant 
thereof; and 
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(b) a second region not naturally associated with the first region comprising a 
protein of at least 100 amino acids. 

10. Nucleic acid encoding for the conjugate of any preceding claim. 

n . An expression vector comprising the nucleic acid of claim 10 operably linked 
to a promoter. 

12. A host cell transformed with the expression vector of claim 1 1 . 

13 A method for preparing a conjugate comprising: 
(i) * culturing the host cell according to claim 12 under conditions which provide 
for the expression of the conjugate from the expression vector within the host cell; 

15 ^ recovering the conjugate, which method comprises fusing an amino acid tail 
to the carboxy terminal end of the conjugate, which tail is capable of binding to a 
first substrate and not to a second substrate, and wherein the conjugate is caused to 
bind via the tail to the first substrate such that components of the host cell do not 
bind to the first substrate; and the conjugate is contacted with the second substrate 
such that the conjugate is not bound and remaining components of the host cell are 
bound to the second substrate. 



20 



14. A method for preparing a conjugate comprising: 
(i) culturing a host cell, transformed with an expression vector comprising 
25 nucleic acid, operably linked to a promoter, encoding (a) a first region comprising the 
homeodomain of antennapedia or a mutant thereof; and (b) a second region not 
naturally associated with the first region, under conditions which provide for 
expression of the conjugate from the expression vector within the host cell; and 
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(ii) recovering the conjugate, which method comprises comprises fusing an 
amino acid tail to the carboxy terminal end of the conjugate, which tail is capable of 
binding to a first substrate and not to a second substrate, and wherein the conjugate ij 
caused to bind via the tail to the first substrate such that components of the host cell 
do not bind to the first substrate; and the conjugate is contacted with the second 
substrate such that the conjugate is not bound and remaining components of the host 
cell are bound to the second substrate. 



15. A conjugate prepared according to the method of claim 13 or claim 14. 

1 6. A phannaceutical composition comprising the conjugate of any one of claims 
1 to 9 or claim 1 5 in combination with a pharmaceutically acceptable carrier. 

17. A phannaceutical composition according to claim 16 in the form of a vaccine. 

18. Use of the conjugate of any one of claims 1 to 9 or claim 15 in the 
preparation of a medicament for the treatment or prevention of cancer, a genetic 
disease and bacterial or viral infections. 

17. A conjugate according to any one of claims 1 to 9 or claim 15 for use in an 
expression system. 
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SEQUENCE LISTING 

(1) GENERAL CJFQKMATIC2J; 

(1) APPLICANT: 

(W HAKE: Imperial College of Science Technology end 
Heal cine 

tej oomninrs ok 

(F) POSTAL CODE (ZIP), SWT 2A2 
fii) TITLE OF INVENTION: Fusion Protein 
(Hi) BOMBER OF SEQUENCES: 1 
(lv) COMPUTER READABLE FORM- 

2 0,1 PC compatible 

(01 SOFTWARE, Patentln Release #1.0. Version tl.zs (EPO) 
(2) INFORMATION FOR SEQ ZD BO: 1- 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: SO amino acids 

(B) TYPE: amino acid 
(Dl TOPOLOGY, linear 

(ii) MOLECULE TSPE: protein 



(Xi) SEQUENCE DESCRIPTION: SEQ ZD NO: Is 

Arg Lya Arg Gly Arg Gla Thr Tyr Thr Arg Tyr Gin Thr Leu Glu Leu 
5 io IS 

Glu Lye Glu pje Hi 3 Phe Aen Arg Tyr Leu Thr Arg Arg Arg Arg a. 

25 3 o 

Glu Zlo Ala Hla Ala r-,, , 

nxa Ala Leu Cys Leu Thr Glu Arg Gin He Lys He Trp 
40 4s 

Phe Gin Asn Arg Arg Met: Lye Trp Lys Ly» Glu Aen 

55 60 
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